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ABSTRACT 

Simple measures of reading, spelling, and written 
expression wei;e administered to 566 elementary students from the 
states of Minnesota, Pennsylvania, and Washington. Each child was 
individually administered three reading word lists, three reading 
passages, two dictated spelling lists, and two story starters. 
Approximately 25 minutes was needed for testing, which was performed 
in the fall and spring. Measures of correctness appropriately 
reflected developmental changes from grade to grade and from fall to 
spring within grades. Differences in mean levels of performance were 
obtained for students from different states, indicating that state or 
school district norms may be more appropriate that national norms 
when using the measures for decision making. (Author/PN) 
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Abstract 

Simple measures of reading, spelling, and written expression were 
administered to 566 elementary students from three states. Measures 
of correctness appropriately reflected deve lopmental changes from 
grade to grade and from fall to spring within grades. Differences in 
mean levels of performance were obtained for students from different 
states, indicating that state or school district norms may be more 
appropriate that national norms when using the measures for decision 
making* Implications of the results for the use of simple measures in 
the classroom are discussed. 



The Use of Standard Tasks to Measure Achievement in Reading,; 
Spelling, and Written Expression: A Normative and Developmental Study 

Epstein (1980) argued for the importance of aggregation in 
psychological research as a means of establishing the stability and 
general ity of findings. Aggregation over occasions, one of four 
facets of aggregation that Epstein described, involves repeated 
assessments of a single behavioral item, thereby cancelling out the 
incidental effects associated with each particular assessments 
Aggregation over occasions increases both the temporal ,reliabi 1 ity or 
stabi 1 ity of findings and their general i ty over occasions. An the 
measurement ' of academic performance, the concept of .aggregation over 
Occasions implies repeated assessments of well-defined academic 
skills. Repeated assessments of defined skills contrast with typical 
measurement practice--the administration of an achievement test 
battery at the beginning and end of the school year. The former 
generates data that are stable and general izable because the data are 
aggregated across occasions; the stability and generality of the 
latter derive from aggregation across stimuli. In recent years, 
instructional methodologies have been developed that capitalize on the 
stability and generality inherent in repeated assessments of academic 
skills (Deno & Mirkin, 1977; White & Haring, 1980). 

These instructional methodologies may be conceptualized as 
systems for making decisions about the effectiveness of teaching 
strategies. The systems involve an operational statement of academic 
skills, frequent if not daily performance samples, goal setting, and 
decision making based upon comparisons of a ch i Id 1 s performance to the 
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progress necessary 4o achieve the goal. Clearly, reasonable and 
appropriate goals are prerequisite to valjd decision making. • Goals 
(or "aims" as they are known in Precision Teaching terminology) are 
determined by specifying a desired level of performance or progress 
* and a date on which the goal is to be achieved. The research 
described in this "paper is an effort to' establ ish normative 
performance levels for elementary- age students on standard 
instructional tasks. In order to clarify the distinction between 
standard instructional tasks and grade level tasks, it is necessary (o 
consider how goals have been established previously'. «> 
Sources of Aims and Goals 

Traditionally, goals have been expressed in terms of .a child's 
level of functioning in grade level material. Starlin (1979), for 
example, suggested that fourth, fifth, and sixth grade readers must 
read at 125 words per minute or better with two or fewer errors in 

* 

order make satisfactory progress in their subsequent grade. This 
criterion was based upon the performance of hundreds of children/and 
the effort required to substantiate this standard was remarkable. 
Nevertheless, there are two shortcomings with this grade level 
approach which weigh against its wholesale adoption and use. First, 
the assumpt ion that the difficulty of a reading series increases 
uniformly as a function of grade level can be challenged on the basis 
of recent research (Fuchs, Fuchs, & Deno, 1981). 

Second, by maintaining a standard criterion and allowing the 
difficulty of the material to change, the importance of improvement is 
minimized. When acceptable performance is defined as a standard 

ERIC 



fluency and accuracy criterion, average readers at all grade levels 
would be functioning at approximately the same level of proficiency* 
The differences among average readers would lie in the difficulty of 
the material, not in different levels of performance. Average readers 
making adequate progress would not be expected to change their level 
of performance; presumably, they would read at 125 words per minute, in 
each increasingly difficult story in the text* 

An alternative to setting goals jn which task difficulty changes 
and performance is held constant is to hold task difficulty constant 
and specify Jncre^ses in performance as the desired outcome. By using 
a standard task, differences in performance across and within grade 
levels are emphasized, rather t than differences in task difficulty* 
Growth and changes in child performance become the .criteria against 
which the effectiveness of teaching programs is measured* 
Rationale for the Use of a Standard Task 

In measuring changes 'in a child's physical stature, the same^ 
scale is used to determine weight and the same rule is used to, 
determine height regardless of the, child's age. As a result, 
development scales for height and weight are Constructed readily. In 
assessing a child's reading development, teachers and psychologists 
rely upon measures with scales that differ from grade level to grade 
level. Vf we were to measure weight in the same way, we would need a 
differenUscale for each year of life; each scale would be calibrated 
so that average chi Idren at each age would weigh the same. 
Differences among children of various ages would be a function of the 
calibration of the scale in the same way that differences among 
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readers are a function of the difficulty of the material when the 
standard of proficiency at grade level is, used. 

If a standard task were used across the elementary grades, one 
obvious question is whether the task would discriminate among children 
at 'the extremes. For example, if difficulty were controlled at third 
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grade level, the material might be so difficult for beginning readers 
that good and poor readers alike would read with the same low 
proficiency. At the opposite extreme, we might anticipate a problem 
with readers at the fifth and sfxth grade levels. The standard task 
might fail to discriminate between good and poor readers at this level 
since good and poor readers alike will have mastered the material. 
Whether standard academic tasks will differentiate among children at 
widely varying levels of proficiency, however, is a question open to 
investigation. 
Purpose of the Study 

The first purpose of this study was to determine the feasibility 
c f using a standard task to measure the reading, spelling, and writing 
proficiency of elementary children. The primary question was whether 
standard tasks could be identified that would discriminate among 
students in the six elementary grades, ahtf whether those tasks would 
discriminate within grade levels at different times of the year. A 
second purpose of the study was to describe procedures for 
establishing local norms on the standard tasks. Presumably, if 
standard tasks could be identified, local districts would be able to 
establish norms so that children in need of special assistance could 
be readily screened and identified. 



Method 

In an effort to provide decision makers with information on how 
students perform on the standard tasks of reading (Deno, Mirkin, 
Chiang, & Lowry, 1980), spelling (Deno, Mirkin, Lowry, & Kuehnle, 
1980), and written expression (Deno, Mirkin, & Marston, 1980)^ these 
measur|£ were administered to a large sample of elementary students 
from three states: Minnesota, Pennsylvania, and Washington. 
Standard Tasks 

Reading , Reading materials developed by Deno, Mirkin/ Chiang, & 
Lowry (1980) were used to collect information on the typical reading 
performance of elementary students. Each student react aloud from 
three isolated word lists and three oral reading passages. 

The word lis-ts measure consisted of three alternative lists of 
140 words each that were randomly selected from the Core List of 5,167 
words 1 isted in Basic Elemontary Reading Vocabulary (Harris & 
Jacobson, 1972). The words were chosen randomly from those at levels 
pre-primer to grade three that had a frequency index of more than 10 
per million words in the Teachers 1 Handbook of 10,000 Words (Thorndike 
& Lorge, 1944). 

The passages measure included three passages of about 300 words 
each. They were selected from the third grade book from three 
different basal reading series: Allyn-Bacon, Ginn 720, and 
Houghton-Mifflin. Each passage consisted of the first part of the 
story. The Fry Readability Index formula (Fry, 1968) was used to 
ensure that each passage was at the third grade level. 

On both measures the examiner recorded the number of words 
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pronounced correctly and incorrectly. Subjects were given 60 seconds 

to read aloud from each stimulus material. For each subject two 

measures of correct reading and two measures of incorrect reading were 

* 

computed: Mean Number of Words Read Correctly from Word Lists, Mean 
Number of Words Read Correctly from Passages, Mean Number of Words- 
Read Incorrectly from Word Lists, and Mean Number cf Words Read 
Incorrectly from Passages. 

Spell inq . The measurement of spelling performance in this study 
focused on two dictated spelling lists developed by Deno, Mirkin, 
Lowry, and Kuehnle (1980). Both lists were composed of randomly 
selected words from levels preprimer to grade three from Basic 
Elementary Reading Vocabularies (Harris & Jacobson, 1972). 

The examiners dictated words for three minutes for each' list 
while the subject wrote his or her responses. A maximum of 15 seconds 
was allowed for each word before the next word was presented. The 
spelling lists were scored for the number of correct letter sequences 
(see White & Haring, 1976) and the number of words spelled correctly 
and incorrectly. For each subject the mean on each measure was 
computed. 

Written Expression . The format developed by Deno, Mirkin, and 
Marston (1980) in the formative evaluation of written expression was 
adopted for this study. Each student was presented two Story Starters 
and given three ^minutes for each to write a -composition. Each 
students composition was then scored for the Mean Number of Tptal 
Words Written, Mean Number of Words Spelled Correctly, Mean Number* of 
Words Spelled Incorrectly,, and Mean Number of Letters Written in * 



Correct Sequence, 
Subjects 

A sample of 566 students enrolled in grades one to six from 
Minnesota, Pennsylvania, and Washington was administered the measures 
of reading, spelling, and written expression. All students were 
selected randomly* within the school districts that volunteered to 
participate in the study. There vjere 275 males and 291* females in the 
total sample. * Of the 566 students tested, there were 92 first 
graders, 85 second graders, 96 tht^d graders, 99 fourth graders, 101 
and 93 sixth graders. 

The Minnesota sample consisted of 134 of the 566 students, 63 
boys and 71 girls. Most of these subjects (73%) were selected from 
two urban areas with populations of 50,000 and 100,000 people. These 
elementary students were approximately equally distributed among 
grades 1 to 6. 

The Pennsylvania sample of students included 157 boys and 169 
girls, equally distributed across the six grade levels. These 
elementary students were randomly selected from t;wo areas (rural and 
urban) in Central Pennsylvania. The remaining 106 elementary students 
tested were from the Seattle, Washington area; 55 were male and 51 
were female. Again, the students were distributed approximately 
equally through grades 1 to 6. 

* • « 

Procedure 

Each child was administered the reading, spelling, and written 
expression measures duc/ng the fall and the spring on a'o individual 
basis by x an examiner trained in the administration of the measures*. 
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Testing lasted no longer than 25 minutes. 

The order of presentation of the materials was as follows: 

1. Three Reading Word Lists (1 minute each) 

2. Three Reading Passages (1 minute each) 

3. Two Dictated Spelling Lists (3 minutes each) 

4. Two Story Starters (3 minutes each) 

Results \ 
Al 1 data gathered on the 566 students were scored by four 
psychometric assistantsV Interscorer reliabilities for the reading, 
spelling, and written expression measures ranged from .96 to .99 
Grade Level Differences 

The mean scores on the four measures of reading performance for 
the fall and spring administrations are presented by grade level in 
Table 1. The mean scores on the three spelling measures are presented 
in the same way in T^able 2 and the mean scores on the four writing 
measures are presented in Table 3. 



Insert Tables 1-3 about here 



r the seven measures of correctness and fluency, an increase in 
mean performance was observed as a function of grade level, for both 
the fall and spring administrations. With minor exceptions, for the 
measures of incorrect responding, there was a decrease in mean number 
of errors with increasing grade level. The most frequent exceptions 
to this occurred for Words Spelled Incorrectly in written expression. 
To test the hypothesis that performance would vary with grade, a 
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simple one-way analysis of variance was conducted for each simple 
measure, with grade level as the independent variable. Separate 
analyses were conducted for the fall and spring means. To check 
further the assumptions that the fluency and correct measures would 
increase and that the error measures would decrease with grade level, 
each measure was subjected to a test of linearity for the entire 
sample. Altogether, 22 ANOVAs and 22 tests of linearity were 
conducted: one for each of the 11 measures for both fall and spring. 

With regard to the main effects for grade, 21 of the 22 F values 
were significant at or below the .001 level of significance. The 
tests of linearity also suggested strong linear trends, with 20 of 22 
£ values significant at or below the .05 level. Twelve of these JF 
values were significant at or below the .001 level. Follow-up tests 
of the grade level main effects were conducted using the Tukey 
procedure; the results of these follow-ups are reported in Table 4 
using Duncan's (1955) underlining procedure. 



Insert Table 4 about here 



In order to establish a set of norms for .each of the direct 
measures, levels of performance were calculated for the 25th, 50th, 
and 75th percentiles for each measure for both fall and^pring. These 
data are presented in Figures 1-14. 



Insert Figures 1-14 about here 
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Annual Growth 

The extent to which students improved between the fall and spring 
administrations on the reading, spelling, and written expression 
measures was analyzed by conducting paired t_ tests at every grade 
level. The average difference scores (spring minus fall) for the 
seven measures of correct performance and their levels of significance 
are reported in Table 5; the difference scores for the four measures 
of incorrect performance and their levels of significance are repprted 
in Table 6. For the measures of correct performance, 41 of 42 t, 
values were statistically significant at or below the .05 level of 
probability, while 37 of these values were significant at or below the 
.001 level. For the error measures, 11 of 24 _t values were 
significant in the hypothesized direction at or below the .05 level of 
probability. Nine of these 11 significant differences were obtained 
on the reading measures. 



Insert Tables 5 and 6 about here 



In general, greater growth was found for the reading and spelling 
measures. Mean differences from fall to spring in correct performance 
on the reading word lists ranged from 9.2 words at fifth grade to 15.5 
at second grade. On the oral reading passage, the differences ranged 
from 11.5 words for the sixth graders to 25.1 words at first grade. 
The fall to spring increase in the number of words spelled correctly 
ranged from 4.2 words at fifth grade and 4.3 words at first grade to 
5.7 words at second and third grades, while the growth in the number 
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of letters ranged from 28.6 letters at first grade to 37.7 letters at 
second grade* On the total words written measure of written 
expression, the differences ranged from 1.7 words at sixth grade to 
8.9 words at second grade. For words written correctly, the range was 
from 3.7 (sixth grade) to 8.0 (second grade); for correct letter 
sequences written, the range was from 17.6 (second grade) to 34.4 
(first grade). 

In general, the mean number of errors on the four measures 
declined between fall and spring at oach grade level, although a 
smaller proportion of these differences was statistically significant 
than was the case for the correct measures. In fact, words written 
incorrectly proved to be insensitive to fall to spring differences at 
four grades when used as a measure of spelling and at all six grades 
when used as a measure of written* expression. 
Stability over Time 

The data presented in the preceding two sections showed 
significant differences in mean performance for all 11 measures across 
grade levels and for nine measures within grade levels from fall to 
spring. These data suggest that the simple and direct measures are 
sensitive to growth. Another issue is stability over time; it is 
possible that the performances of only a few individuals accounted for 
the mean differences on the direct measures. To address this issue, 
stability coefficients were computed by correlating individual 
performance in the fall and the spring. Significant differences 
between test sessions and moderate to high stability coefficients 
would indicate that most students improved within the school year. 
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Stability coefficients are presented in Table 7. A definite 
pattern emerges in the correct and error measures: The measures of 
correctness produced coefficients that were greater than those 
produced by the error measures. In addition, the measures of correct 
performance in reading proved to be the most stable; these 
coefficients ranged from .58 at first grade to .94 at fifth grade on 
the word list measure. For spelling, the coefficients for the 
measures of correct performance ranged from .21 at first grade to .86 
at s'econd grade on the words spelled correctly measure. Stability for 
the measures of written expression was somewhat lower; the 
coefficients ranged from .20 at first grade to .78 at sixth grade on, 
the words written correctly measure. 

Insert Table 7 about here 



With respect to grade level, the stability coefficients were 
fairly stable for all grades except first grade. The median stability 
coefficients of the seven correct measures for grades 1 to 6 were .41, 
•77, .68, .77, .64, and .78. 
State, Demographic, and Sex Differences 

Separate analyses were conducted to determine the extent to which 
the vari ance was affected by state, rural or urban setting of the 
sample, and the sex of the children. Of the 22 analyses conducted 
with state as the independent variable, 12 main effects were 
significant at the .05 level or below. The significant effects were 
obtained on measures of correct performance in reading, spelling, and 



9 

ERLC 



-». I 



13 

written expression. Differences were more common in the fall analyses 
than the spring analyses, and most favored the Minnesota sample over 
the Washington and Pennsylvania samples. 

Of the 22 analyses of urban, small urban, and rural differences, 
only four main effects proved significant at the .05 level or below. 
All of these differences occurred at the fall administration and three 
were obtained on correct measures of written expression. In general, 
the urban samples scored lower on these measures than either the small 
urban or rural samples. Interestingly, state accounted for a greater 
proportion of the variance in most of the measures than the urban- 
rural variable. 

With regard to sex of the student, hi of the 22 analyses yielded 
significant main effects at the .05 level or below. These significant 
differences appeared on measures of correct performance in readng, 
spelling, and written expression, for both fall and spring 
administrations. In all of these differences, the girls in the total 
sample outperformed the boys. The differences in the two spelling 
measures (words spelled correctly and letters spelled correctly) and 
the three measures of written expression (total words written, words 
written correctly, and letters written correctly) are predictable 
given the apparent interrelatedness of the measures. 

Discussion 

The results of the present study provide ample evidence in 
support of the hypothesis that standard tasks at constant levels of 
difficulty can be used to measure developmental changes in student 
performance. In fact, there was a remarkable regularity in all 
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measures of correct performance in reading, spelling, and writing— 
from grade to grade; from fall to spring within grades; and, in every 
instance, from spring to faft across adjacent grades. 

There was less regularity in the measures ' of incorrect 
performance. For example, in the fall at several grade levels, the 
children made more errors than would be predicted by the spring term 
error rates of the children in the next lower grade. This same 
pattern held true but to a lesser extent in spelling. A more 
interesting anomoly in the incorrect spelling measure was an increase 
in errors from grade 1 to grade 2. This increase may be explained as 
an increase in the proficiency of the children at the tool skill of 
writing words. Although total errors increased, the percent of words 
that the children spelled i ^ectly increased steadily from the grade 
1 fall administration to the grade 2 spring administration. This same 
pattern held in the measures of incorrect performance in written 
expression, and probably represents a valid developmental effect. 

The analyses summarized in Table 4 also illustrate that the 
measures of correct performance in reading, spelling, and written 
expression discriminated more clearly between grade levels than the 
measures of incorrect performance. In reading, incorrect performance 
on both the word lists and the passages was not differentiated by 
grade above grade 2; in spelling and written expression, the measures 
of incorrect performance failed to discriminate at any grade level. 
Both of these relationships hsld for both fall and spring 
administrations. It is interesting to note that when the measures of 
correct performance fai led to di scr iminate between adjacent grades , 
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grades 3, 4, and 5 almost always were involved* Since the grade 6 
performances were separated from the lower grades on al 1 but the 
spring administration of Total Words Written and Words Spelled 
Correctly, this lack of discriminabi 1 i try among grades 3, 4, and 5 
cannot be explained as a ceiling effect. 

In reading, performance on the word lists was as discriminating 
as performance on the passages, even though the latter enjoys greater 
credibility and face validity among teachers* Given that word lists 
are easier to construct and that by randomizing the order in which the 
words are presented, equivalent forms of the same list can be made, it 
would seem that lists might be preferred to passages. In spelling, 
both words correct and letter sequences correct proved equally 
discriminating. Given the relative ease of scoring correct words, the 
former measure would seem to be the measure of choice. However, in 
written expression, the outcome was somewhat different: the finer the % \ 

unit of analysis, the better its discriminaoil ity. Thus, whereas the 
Total Words Written measure failed to discriminate between grades 3 
and 4 in the fall, and grades 3 and 4 and grades 5 and 6 in the 
spring, the Words Spelled Correctly measure failed only to 
discriminate between grades 3 and 4 and grades 5 and 6 in the spring. 
Correctly Written Letter Sequences, on the other hand, discriminated 
between all adjacent grades for both the fall and spring 
administrations. 

Whelh coupled with the results of earlier research on the 
criterion validity of the tasks used in this study (Deno, Mirkin, 
Chiang, & Lowry, 1980; Deno, Mirkin, Lowry, & Kuehnle, 1980; Deno, 
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Mirkin, & Marston, 1980), the evidence that regular measurement of 
performance on standard tasks can be used to provide "vital signs" of 
educational growth is persuasive. Measuring ^performance on a standard 
task of constant difficulty 'facilitates comparisons that are not 
possible when different tasks of different difficulty are used to 
measure performance. For example, increases in reading, spelling, and 
writing proficiency are represented by increases in actual behavior 
counts rather than by relative standing within the peer group. And, 
since students from different grades or at different ages are all 
measured on the same task, comparisons in growth within and between 
individuals across time easily can be made on the same behavioral 
scale. It is clear that given the same amount of time and the same 
text passages, students who read more words correctly are more 
proficient readers. This statement is true regardless of whether the 
difference is between the scores of students at different ages or 
between scores for the same student at different ages . The same 
general conclusions hold for the standard tasks used in spelling and 
written expression. 

The conclusions are likely to be true only in general, however. 
We are sure to find students whose scores are the same but who 
function differently when measured on related tasks, or students whose 
scores are x dif ferent but who function the same on related tasks. In 
the face of such qualifications that are required of virtually all 
research on human behavior, .the conclusion that "production" or 
"correct output in fixed 'time" is a critical indicator of academic 
growth seems inescapable. 
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Turning to a second issue, the simplicity and economy of the 
standard tasks used in the research makes them ideal for routine use 
in the classroom. Teachers easily can obtain the behavior samples on 
a weekly basis as an integral part of their instruction. Doing so 
would permit, as Epstein (1980) argues, aggregation of data for 
purposes of establishing stability. Aggregation of repeated 
measurements surely will make possible reliable decisions about 
individuals that we cannot now make with scores from a single 
administration of a standard test. Further, with the potential of 
repeated measurement over time it becomes possible to make program 
evaluation decisions based on time series data that are not possible 
when pre and posttesting are used (Campbell, 1969). 

Finally, the data from the present research point to the 
possibility that typical growth rates on the standard tasks could be 
established and used to place the growth of individual students in a 
normative perspective. The growth curves obtained are quite smooth, 
and the distributions within grade levels have the technical 
characteristics required for making within-grade discriminations. 
The measurement scale is sufficiently refined to permit reliable 
discriminations from fall to spring and growth in that period is 
linear. Finally, while establishing national norms on standard tasks 
is possible, the differences obtained among states leads us to 
recommend that states and local school districts develop their own 
norms as a basis for decision making. 
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Table 1 



Mean Correct and Incorrect Performance on Two Reading Tasks by Grade Level at Fall and Spring 



Grade 



1 
2 
3 
4 
5 
6 



Word Lists 



Words Correct 



Fall 



Spr 



3.75 
25.50 
49.90 
61.32 
68.63 
83.53 



17.32 
41.02 
63.02 
71.57 
77.76 
90.71 



Words Incorrect 



Fall 



18.21 
12.72 
6.03 
4.73 
4.34 
3.80 



Spr 



12.34 
9.56 
4.64 
3.51 
3.31 
2.87 



Passages 

Words Correct Words Incorrect 



Fall 



Spr 



11.31 
57.22 
98.92 
113.93 
128.76 
147.17 



36.42 
81 .90 
119.18 
127.57 
142.25 
158.65 



Fall 



23.69 
9.88 
4.81 
4.05 
3.83 
4.23 



Spr 



8.94 
6.04 
3.95 
3.14 
3.53 
3.05 
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Table 2 






* 


Mean Correct 


and Incorrect Performance 


on the Spelling Task by Grade Level at' 


Fall and Spring 






Correct 


Words 


Incorrect 


Words 


Correct Letter Sequences 


Grade 


Fall 


Spr 


Fall 


Spr 


• Fall 


Spr 


' 1 


1.53 


5.86 


9.58 


9.54 


1 7 .70 


• 46.31 


2 


6.35 


12.01 


10.85 


10.12 


60.88 


98.58 


3 


13.96 


19.68 


8.97 1. 


7.96 . 


104.03 


138.95. ' 




18.67 










4 


26.65 


8.10 


6.62 


131 .04 


160.49 


5 


22.70 


"'26.98 


8.36 


6.45 


154.93 


185.51 


6 


27. j? 


32.16 


6.41 


5.53 


180.31 


211.73 


: ••• \ 
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Table 3 



. ^Mean Correct and Incorrect Performance on the Writing Task by Grade Leve\ at Fall and Sprang 

. ; » ■ lV . 























Total 


a Words 


Co rrprt 


n \J i vi o 


I ncor rect 


Wo rds 


Correct Letter Sequences 


Grade 


Fall 


Spr 


Fall 


Spr 


FalJ 


Spr 


Fall 


\ Spr ■ 


1 


7.84 


15.64 


5.01 


11 .59 


3.68 


-.4 .'84 


22.52 / 


r 55.25 


2 


19.67 


28.52 


15.62 


0 23.64 


4.12 


5.00 


73.43 . 


107.83 


3 


32.02 


37.04 


28.98 


34.19 


3.25 


3.06 


132.51 


1*5.1 .49 


4 


37.53 


4K38 


34.84 


38.96 


2.98 


2.71 


159.68 


178.85 


5 


43.98 


49.22 


41 .18 


46.87 


3.29 


2.74 


189.53 


214.52 


6 


52.09 


t 53.72 


47.38 


51.08 ; 


5.87 


3.30 


220.38 


240.02 ' 



t 
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Differences Among Grade Level Means on All Tasks at Fall 
and Spring Using Duncan's Underlining Notation 
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Measure 



Grade 



Reading 

Words Correct on Word Lists 
Words Incorrect on Word Lists 
Words Correct on Passages 

Words Incorrect on Passages 



Spel 1 ing 

Words Spelled Correctly 



Words Spelled Incorrectly 

Correct Letter Sequences 

Written Expression 
Total Words Written 

Words Spelled Correctly 

Words Spel 1 ed- Incorrectly 

Correct Letter Sequences 



Fall 
Spr 

Fall 
Spr 

Fall 

Spr 

Fall 3 
Spr a 

Fall 
Spr 

Fall 3 

Spr 

Fall 
Spr 

Fall 
Spr 

Fall 
Spr 

Fall 3 
Spr 3 

Fall 
Spr 



2 
2 

2 
2 

2 

2 

2 
2 



3 
3 



4 
4 



Grades ranked from highest to lowest mean scores. 



6 
6 



6 
6 



6 
_6 

6 
_6 

_4 
4 



•V 



Table 5 

Spring - Fall Difference Scores for Seven Measures of Correct Performance 



Grade 



Measure 

Reading 
Word Lists 
Passages 

Spel ling 
Words 

Letter Sequences 

Written Expression 
Total Words- 
Words Correct 
Letter Sequences 



13.6*** 
25.1*** 

4.3*** 
28.6— 

7.8*** 
6.5*** 
34.4*** 



15.5*** 
24 b 7*** 

5.7*** 
37.7*** 

8.9*** 
8.0*** 
17.6*** 



13.1*** 
20.2*** 

5.7*** 
34.9*** 

5.0* 
5.2** 
28.9*** 



10.2*** 
13.6*** 

5.0*** 
29.4*** 

3.8*** 
4 ] *** 

19.2*** 



9.2*** 
13.5*** 

4.2*** 
30.6*** 

5.3*** 
5.7*** 
25.0*** 



7.2* 
11.5** 

4.8*** 
31.4*** 

1.7 

3.7*** 

"j g 7*** 



Significance levels are denoted as follows: 
*E=.05 

**£=.01 
***£=. 001 
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Table 6 

Spring - Fall Difference Scores for Four Measures of Incorrect Performance 



Measure 



Reading 

Word List Errors 
Passages Errors 
Spelling 
Incorrect Words 
Written Expression 
Incorrect Words 



Grade 



-5.9*** 
■14.8*** 

-0.4 

1.2** 



•3.2** 
-3.8*** 

-0.7 

-0.9 



-1.4*** -1.2* -1.0*** 
-1.1 -0.9*** -0.3 



-1 .0 



-0.2 



Significance levels are denoted as follows; 
*£=.05 

**£=.01 

***£=. 001 



-1 .5" 



-0.2 



-1 .9" 



-0.5 



-1 .0* 
-1 .2 

-0.9 

-2.5 



ERJC 



25 

Table 7 

Stability Coefficients on Standard Tasks of Reading, Spelling, 
and Written Expression by Grade Level 





Variable 


1 


2 


Grade 
3 


4 


5 


6 


Entire 


Reading 










- 






Words Correct-- 
Word Lists 


.58 


.91 


.92 


.88 


.94 


.61 


.89 


Words Incorrect- 
Word Lists 


.45 


.56 


.74 


.58 


.86 


- .68 


.65 


Wnrd^ PnrrPft-- 

Passages 


67 


93 


91 


88 


.87 


72 

■ / Cm 


.92 


Words Incorrect- 
Passages 


.23 


.49 


.52 


.61 


.87 


.18 


.47 


Sppll ing 
















Words Corr&ct 


. .21 


.86 


.85 


.84 


.84 


.79 


.91 


Words Incorrect 


.29 


.66 


.35 


.26 


.34 


.35 


.38 


Correct Letter 

Sequences 

i 


.41 


.77 


.68 


.77 


.56 


.78 


.86. 


i 

Writing 
















Total Worlds 


.47 


.59 


.27 


.72 


.55 


.49 


.70 


Words Correct 


.20 


.68 


.37 


.74 


.60 


.78 


.82 


Words Incprrect 


.43 


.11 


.16 


.58 


.37 


.04 


.07 


Correct Letter 
Sequences 


.36 


.70 


.64 


.74 


.64 


.79 


.86 



] 



i 

it 
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Figure 1 . 

160 TOTAL NUMBER OF WORDS READ CORRECTLY 
IN ONE MINUTE FROM 3RD GRADE READING 
WORD LIST FOR 25TH, 50TH AND 75TH 
140-1 PERCENTILE 

Fall 



120 - 




GRADE 
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Figure 2 . 

TOTAL NUMBER OF WORDS READ CORRECTLY 
IN ONE MINUTE FROM 3RD GRADE READING 
WORD LIST FOR 25TH, 50TH AND 75TH 
PERCENTILE 
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Fifure 3. 

TOTAL NUMBER OF WORDS READ 




1 2 3 4 5 6 

GRADE 
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Figure 5. 
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TOTAL NUMBER OF WORDS SPELLED CORRECTLY 
DURING A 3 MINUTE SPELLING TASK FROM 3rd 
GRADE WORDS FOR 25th, 50th, 75th * 
PERCENTILE 

FALL 75% 




3 4 
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Figure 6. 

TOTAL NUMBER OF WORDS SPELLED CORRECTLY 
DURING A 3 MINUTE SPELLING TASK FROM 3rd 
GRADE WORDS FOR 25th, 50th, 75th m 
PERCENTILE 



30- 



20- 
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T 
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~n 

3 
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Figure 7, 



280- 



TOTAL NUMBER OF LETTER SEQUENCES 
SPELLED CORRECTLY DURING A 3 MINUTE 
SPELLING TASK FROM 3 rd GRADE WORDS 
FOR 25 th, 50 th AND 75 th PERCENTILE 



Fall 



240- 



200- 



160- 
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80- 
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Figure 8, 

TOTAL NUMBER OF LETTER SEQUENCES 
SPELLED CORRECTLY DURING A 3 MINUTE 
SPELLING TASK FROM 3rd GRADE WORDS 

FOR 25th 5-50 th AND 75 th PERCENTILE 
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Figure 9 

TOTAL NUMBER OF WORDS WRITTEN DURING 
A 3 MINUTE WRITING TASK FOR 25th , 50th , 
75 th PERCENTILE' 



FALL 




35 



figure 1Q, 

TOTAL NUMBER OF WORDS WRITTEN DURING 
A 3 .MINUTE WRITING TASK FOR 25th , 50th, 
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Figure 11 



TOTAL NUMBER OF WORDS SPELLED 
CORRECTLY DURING A 3 MINUTE WRITING 
TASK FOR 25th , 50th , 75th PERCENTILE 
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Figure 12. 

TOTAL NUMBER OF WORDS SPELLED 
CORRECTLY DURING A 3 MINUTE WRITING 
TASK FOR 25th , 50th , 75th PERCENTILE 




GRADE 
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Figure 13. 

TOTAL NUMBER OF LETTER SEQUENCES 
SPELLED CORRECTLY DURING A 3 MINUTE 
WRITING TASK FOR 25th, 50th, 75th 
PERCENTILE 
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Figure 14. 

TOTAL NUMBER OF LETTER SEQUENCES 
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